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Local softness, softness dipole and polarizabilities of 
functional groups: application to the side chains of 

the twenty amino acids 

Alisa Krishtal 1 , Patrick Senet 2 * Christian Van Alsenoy 1 

Abstract 

The values of molecular polarizabilities and softnesses of the twenty amino 

acids were computed ab initio (MP2). By using the iterative Hirshfeld scheme 

to partition the molecular electronic properties, we demonstrate that the values 

of the softness of the side chain of the twenty amino acid are clustered in groups 

reflecting their biochemical classification, namely: aliphatic, basic, acidic, sulfur 

containing, and aromatic amino acids . The present findings are in agreement 

with previous results using different approximations and partitioning schemes 

[P. Senet and F. Aparicio, J. Chem. Phys. 126,145105 (2007)]. In addition, 

we show that the polarizability of the side chain of an amino acid depends 

mainly on its number of electrons (reflecting its size) and consequently cannot 
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be used to cluster the amino acids in different biochemical groups, in contrast 
to the local softness. Our results also demonstrate that the global softness is 
not simply proportional to the global polarizability in disagreement with the 
intuition that "a softer moiety is also more polarizable" . Amino acids with the 
same softness may have a polarizability differing by a factor as large as 1.7. 
This discrepancy can be understood from first principles as we show that the 
molecular polarizability depends on a "softness dipole vector" and not simply 
on the global softness. 
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2 Introduction 

Classification of molecules in families depending on their functional groups is as old as 
chemistry itself. In principle, quantum mechanical calculations contain the necessary 
information to evaluate the chemical (quantum) similarity of any two molecules or 
any two fragments within a set of different molecules. Therefore ab initio calculations 
should be able to recover the empirical classifications of chemistry and more generally 
to predict the "similarity" between two molecules in a chemical library. The later 
application is important in pharmacology where ab initio calculations can be used to 
discriminate a potential drug within a large set of molecules.^ 

In the present work, one shows how certain descriptors defined in Density Func- 
tional Theory (DFT) can be used to discriminate a cluster of molecules within a 
"family" , where a family is defined as a set of molecules having a fragment in com- 
mon. Two members of such a family differ from each other by a "variable fragment" . 
For example, the amino acids form a family because each member has a fragment in 
common with the others (the amino-acid part or backbone) and a variable fragment 
(side chain). One confirms below that the concepts of local softness^ s(r) allow to 
identify different subgroups of "similar" molecules within a family. 

The local softness s(r) can be computed at any point in space and is proportional 
to the change in the electronic density p(r) induced by a shift SN in the number 
of electrons iV of a molecule The integration of the local softness of an isolated 
molecule over all the space is the (global) molecular softness S = J dr s(r). The 
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molecular hardness is defined as the inverse of (global) softness r] = l/SPS Both 
softness and hardness have been extensively studied in recent yearsP^ 
Local softness can be computed by using the following relation^ 



where \x is the molecular chemical potential and v(r) the external potential. The 
functional derivative in Eq. (TTJ) is evaluated at constant N. In the present paper, one 
applies the "frozen orbital approximation" to evaluate the derivative in Eq. (pQ) and 
one chooses the chemical potential fi as the average between the energies of the highest 
occupied (HOMO) and lowest unoccupied (LUMO) molecular orbitals, ™ 3 ™ 51 i.e. 
P = (zlumo + £homo) /2. The hardness r\ is defined as the difference between the 
energies of the frontier orbitals, also named the HOMO-LUMO gap, i.e. r\ — 1/S — 
£lumo — £HOMO- ie Within this approximation, one has the usual relation 

s ^ = \Plumo{y) + Phomo(t)} ^2) 
2 [clumo ~ £homo] 

For an isolated molecule, the spatial variation of s(r) is entirely due to the varia- 
tions of the HOMO/LUMO frontier orbitals because the HOMO-LUMO gap remains 
constant. On the contrary, if one wishes to compare the values of s(r) between two 
members of the same family also the difference in their HOMO-LUMO gaps is of 
importance. 

The variations of s(r) within a molecule, or its comparison between two molecules, 
are more easily analyzed by using a coarse-grained representation of this function ob- 



tained by partitioning the electronic density into fragmentsP^H2H Within a fragment k, 
the function s(r) is replaced by a single value Sfc, called the "condensed softness"^ and 
computed by using one of the partitioning schemes of the electronic properties'^ 
In a recent work, one of us demonstrates that such condensed softness is correctly 
described as a polarization of the fragment k by an effective potential produced by 
the rest of the moleculeP^H The concept of Coulomb hole, recently introduced, is 
a measure for the amount of charge induced by one fragment on the another. 
More precisely, for a family with a common fragment 1 and a variable fragment 2, 
one demonstrated that the softness S of any molecule of the family is related to the 
softness of its functional group (52) by a linear relation' 

S = s 2 [l-q h g (l)]+S~ 1 , (3) 

where q^(l) defines the average Coulomb hole of the molecular family and has the 
meaning of an "induced charge" within fragment 2 by one hole (charge +e) on frag- 
ment 1 [See Eq. (13) in Ref. (120]) ]. Eq. (J3j) allows to compute the average softness S\ of 
the common fragment of a family of molecules and allows to evaluate the similarities 
and differences between the individual members of the family: for the amino acids, 
the positions of the molecules with similar functional groups were found close to each 
other on the (S,s 2 ) map' 

Another important molecular descriptor of the electronic properties of a molecule 
is its polarizability. Softness and polarizability are assumed to be related: "a soft 
species is also more polarizable" P But how to compare the local polarizability and the 
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local softness (or frontier orbitals) of fragments within a molecule from first principles? 
To answer this question, the local polarizability of a molecule is defined (Section 2.1.) 
as follows: 

dp(r 



oiij[T) = -en 



dE* 



(4) 



N 



where i and j stand for Cartesian directions and E is a uniform electric field applied to 
the molecule. In Eq. as well as in all equations below, the vector r represents the 
position relative to the center of mass of the molecule, i.e. the origin of the Cartesian 
axis is chosen as the center of mass of the molecule by convention. 

The applied field E is derived from an external potential Av (r) = e £\ r j^j ( e = 
1.602 1CT 19 C). As shown in the next Section, the local polarizability tensor is related 
to the local softness by 



a<y(rj 



s[Y)eri 
S 



dr's(r')er'j 



(5) 



where a^-(r) is the local polarizability tensor at a constant electronic chemical poten- 
tial. The molecular polarizability is found by integrating the local polarizablity: 



atij = I dr otij(r). 



(6) 



The isotropic part of the molecular polarizability tensor is in obvious notations (Sec- 
tion 2.1.) 



a 



\Jdrs{ 



r er 



3S 



(7) 



The numerator of the last term in Eq. (|7j) is the square length of the "softness dipole 
vector", D = J drs(r)er. 

The local polarizability tensor c%(r) can also be partitioned into fragments. In 
fact, the partitioning of the polarizability of a molecule into a sum of polarizabilities of 
its parts has a long history. In some sense, the concept of "local polarizability" is quite 
old.^ Ab initio calculations have established that the polarizability (more precisely 
the trace of the polarizability tensor) of an atom or of a molecule is proportional to 
its volume pZES This explains some sucess of the "additive rule" of polarizabilities 
which consists of summing up the empirical values of the polarizabilities of the atoms 
or functional groups of a molecule to estimate its global polarizabilityP^GZI 

In the present study, local softness s(r) and local polarizability «jj(r) computed 
ab initio were obtained by applying the Hirshfeld scheme, which is based on a parti- 
tioning of the density of the electronsP^HHS] The partitioning of s(r) computed by 
applying the Hirshfeld method is compared with previous results^ where the Contr- 
eras et al.^ partitioning scheme, based on molecular orbitals, was applied. It should 
be emphasized that the present study provides the most extensive study of polariz- 
abilities and softness of isolated amino acids and of their side chains. The present 
work differs from earlier studies of reactivity descriptors of the subset of the twenty 
amino acids, which were mainly devoted to determine the protonation site in the 
amino acid region GSHUS] 

The paper is organized as follows. The theory and numerical methods are sum- 



8 



marized in the next section. Results are presented and discussed in Section 3. The 
paper ends with final concluding remarks in the last Section. 



3 Theory and methods 

3.1 Local polarizability 

The dipole of a molecule with M atoms is given by 

/M 
dr r p(r) + Z A eR A , (8) 
A=l 

where e is the elementary charge (e = 1.602 10 -19 C), Za is the atomic number of 
atom A and R A is its position relative to the center of mass of the molecule chosen as 
the origin of the Cartesian coordinates. The components of the dipole polarizability 
tensor of the molecule are given by 

10 dE j J dEj ' [ ) 

where % and j stand for Cartesian directions and E is a uniform electric field applied 
to the molecule. Therefore, the local polarizability of a molecule is defined as follows: 

Mr) = (10) 

The applied field E derives from an external potential v(r) = eYlj r j^j- The ten- 
sor OLij{r) is symmetrical because one can write: 



dpjv) 



dr' 



5p(r) 



5v(r') 
n SE i 



_5v(r') 

= J dv' Xl {vy)er' p (11) 
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where Xi is the so-called linear polarizability kernel with the property J <ir'x 1 (r,r / ) = 
OP The global polarizability is obtained by integration of the local polarizability over 
all the space 

J dra ij {r) = - J dr J dr'er i Xi(r,r')er}. (12) 



OLij 



The last equality in Eq. (TT2]) is a well-known relation between molecular polarizability 
and the symmetrical kernel xi [See for instance Ref. (1381) ]. 

The relation between the local polarizabity and the local and global softnesses can 
be established by using the so-called Berkowitz-Parr relation^ 



*v_ _/n , s ( r ) s ( r ') 



Xl (r,rO = X^(r,r / )+ v ^ y , (13) 

where Xi( r ) r ') is the polarizability kernel at a constant electronic chemical potential 
//P One has the relation Xi( r ) r = — s(r,r') where s(r, r') is the so-called softness 
kernel related to the local softness by integration: s(r) = J dv's(r, J3H3S1 Replacing 
Xi in Eq. (ITTj) by the right-hand side of Eq. ( TT3"j) . one obtains in obvious notations 
for the local polarizabity [Eq. ffTO]) ] 
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ay(r) = ag(r) - er iS (r) / dr'^-^, (14) 



and for the global polarizability [Eq. (JS}] 

J drs(r)eri j dr's(r')er'j 

= (15) 
where -Dj is the component of the softness dipole vector 

D = J drs(r)er, (16) 
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in the Cartesian direction i. The isotropic part of the polarizability tensor is 



Q^xx ~t~ ^yy ~~~ ^-z 



u 

a — 

ISO 



\j dr s(r)er|| 2 
3S ' 



D 2 

= a iso - ^g- ( 17 ) 

where D 2 is the square length of the softness dipole vector. It should be emphasized 
that the values of a^ (r) and of the softness dipole vector, as defined above, are defined 
for a molecule for which the center of mass is the origin of the Cartesian coordinates. 

Relation (TlTj) is interesting since it shows explicitely how polarizability is related 
to softness. A large S means a less negative contribution to the right-hand side of Eq. 
( TT7|) and a larger value for the polarizability on the left-hand side of Eq. ( TT7|) if a^ so is 
constant. However, the polarizability depends also explicitely on the softness dipole 
D. It is the ratio D 2 /3S which is the most important quantity relating polarizability 
and softness. Since polarizability is positive one must have the following relation 

D 2 

A,. > w (18) 



ISO 



The quantity D 2 /3S is a lower bound for the polarizability of any molecule. 

3.2 Hirshfeld partitioning applied to softness and polariz- 
abilities 

In a previous studyj^ each amino acid was separated into a backbone (fragment 
1) and a side chain (fragment 2) by applying the partitioning method proposed by 
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Contreras et al.^ In the present work, we test the dependence of the (S 1 ,^) map on 
the partitioning method by using another partitioning of the electronic properties: the 
Hirshfeld-I scheme.^ The Hirshfeld-I scheme allows to write the electronic density of 
a molecule p(r) as a sum of atomic contributions PA{ r )'- 

p(r) = ^PA(r) = ^^(r)p(r), (19) 

A A 

where the summation is over all atoms A of the molecule. The atomic weight function 
oja{*) m Eq. (fl9l) is built iteratively from the atomic densities obtained in the previous 
iteration until self consistence is reached 

The electronic density of an atom at iteration n = (initial guess) is the density of 
the isolated atom. Therefore, the initial value of the weight function u A is identical 
to the weight function of the usual non-iterative Hirshfeld scheme: 

" iw = A (21) 

The number of electrons of each isolated atom is N A — Z A — J dr p A (r). The weight 
function in Eq. (|21[) is then used to determine the number of electrons N A of each 
atom at the iteration n = 1 

N\ = Z A - q\ = Z A - J dr ^(r)p(r). (22) 

In the next iteration, the weight function oj a {t) of an atom [Eq. (120]) ] is constructed 
from its atomic density p A (r) which integrates to N\ = J dr p A (r). Because N A is 
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generally a non-integer number, p\(r) is built from a linear interpolation between the 
densities of atoms with an integer number of electrons smaller (I) and larger (u) than 

P\(r) = W " N\)pV(r) - (N\ - iV^)p^(r), (23) 

where N A ,U = J dr p° A ,u (r) and N^ 1 = f dr p^(r) are the upper and lower integer 
limits of N\, respectively. The procedure is repeated until the difference in atomic 
populations N A and iV^ _1 of two subsequent iterations n and n — 1 becomes zero. 
The self-consistent value of the weight function is noted as ua(t) hi the rest of the 
paper. 

The softness of an atom in the molecule is simply defined as: 

s A = J dr u A {r)s(r) = J dr s A (r), (24) 

and the softness of a fragment x is computed as the sum of the contributions of the 
atoms belonging to the fragment: 



The Hirshfeld-I method is applied to divide the local polarizability ay(r) into 
condensed atomic polarizabilities. The dipole of an atom A in a molecule is defined 



pA = - e J dr [ r " R 1 Mr), (26) 
The molecular dipole moment can be reconstructed exactly from atomic dipole mo- 
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ments as follows 

P = ^P A + g A eR A (27) 

A 

where qA = Za — Na is the atomic charge. 

The intrinsic condensed atomic polarizability of an atom A is defined by the 
derivative of the atomic dipole moment with respect to the electric field: 

dP, A 



< 



-ef dr [n-Rt] d -^. (28) 



The total polarizability of the molecule is reconstructed by summing over the atomic 
polarizabilities and adding the corresponding charge delocalization contribution 



QCij 



£ [< + > ( 29 ) 



where the first-order perturbed atomic charge is computed by using the relation 



The total polarizability of a fragment x is computed by summing over the intrin- 
sic polarizabilities of the atoms within the fragment and the intrafragmental charge 
delocalization contribution as follows 



(3i) 

where Q x is the position of the center of mass of fragment x. As shown in our previous 
studies,^ the definition of the charge delocalization contribution to the polarizability 

14 



of a fragment with respect to its center of mass ensures that identical fragments at 
different positions in different molecules (such as the backbone fragment in different 
amino acids or water molecules with identical hydrogen bonds in different water 
clusters^) have similar polarizability. In addition, by introducing the center of mass 



of the fragment in the definition of its polarizability Eq. (I3ip . we have shown that the 
remaining interfragmental charge derealization contribution to the polarizability of 
the molecule to which the fragment belongs, i.e. Yl x YIabx ^A^t ~ ^f)> * s nearly 
constant for molecules of similar size. 

Only the isotropic polarizabilities of fragments, defined as a third of the trace of 
their polarizability tensor, will be discussed and presented. 



3.3 Softness dipole vector in the Hirshfeld scheme 

The contribution of the "softness dipole vector" to the polarizability of an atom in a 
molecule can be computed as follows. The key quantity in Eqs. (125]) and (l3"Uj) is the 
derivative of the atomic electronic density relative to the applied electric field : 



By using Eqs. (TTTT) and (TIB"]) . Eq. ( 1521) can be written as follows 



dpA(r) 
dE j 



w A {r) I dr' 
dp A {r 



Xi(r,r') + 



sir sir 



S 



dEj 

dpA(r) 
dEj 



+ w A (r)s(r) 



J dr's(r')er'j 



S 



+ w a (t)s(t) 



s ' 



(32) 



(33) 
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in which we have defined the derivative at a constant chemical potential (first term 
of the righ-hand side of the last equality) and where Dj is the Cartesian component 
of the molecular softness dipole defined above [Eq. ([IS]) ]. Using Eqs. fl28l) and (l33jh 
the intrinsic condensed polarizability of an atom can be expressed as follows 



< 



j dre [r, - R?] [~^f\ ~ f / ' * e [r, - R?} w A (r)s(r), 
J dre[n-Rf] s A {r), 



i3 S 



= (34) 

where s A (r) has been defined above [Eq. (1241) ] and the atomic softness dipole vector 
is defined as 

Df = eJ dr [ n -R?]s A (r). (35) 

The intrinsic polarizability of an atom can thus be decomposed into a contribution 
at a constant chemical potential \x and a dipole-softness contribution 



A U-A . A.DS fnr\ 

= a# + > (36) 



with 

<#» = (37) 

On the other hand, using Eqs. (I3"U1) and f[3"3"j) . the charge induced by the electric 
field can be written as : 

5 » = _ e y dr ^) = ,5«_^. (38) 
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The respective contributions to the total polarizability of the molecule are again 
reconstructed by adding the atomic contributions and the respective charge dereal- 
ization contributions: 

+ (39) 



A 



and 



E 

A 



o A,ns_(DjS A]nA 



(40) 



Eq. ( 14T)]) allows us to compute the contribution of the dipole softness polarizability to 
the polarizability of a fragment [Eq. f )3T|) ] by summing in Eq. f HUj) over the atoms of 
a fragment: 



x,DS 



E 



. DS _[D£A\ eR A 



s 



(41) 



A3x 

It should be emphasized that in the definition of the dipole-softness polarizability 
[Eq. (T4T]) ] we do not compute the charge derealization relative to the center of mass 
of the fragment as it was done in the definition of the polarizability of a fragment 
[Eq. (151]) ]■ Indeed, the molecular dipole softness Dj is already defined relative to the 
center of mass of the molecule. On the other hand, there is no reason to enforce the 
transferability of the dipole softness contribution to the polarizability. Indeed, the 
contribution of a fragment x to the molecular dipole softness, D x - = J dr es x r, is 
expected to be different for identical fragments in different molecules on the opposite 
to afj. As will be discussed below, the local softness s\ = S — s 2 of a backbone 
(which is different from 5*i in Eq. (j5J), see Eq. (23) in Ref. ( 12 ip for more details) is not 
identical in two different amino acids but depends strongly on the relative reactivity 
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of the rest of the molecule, namely the side chain; the softness si of the backbone will 
be large in amino acids with a "hard" side chain (s2 small as in glycine) and small in 
amino acids with a "soft" side chain (s2 large as in phenylalanine). For this reason 
one can expect the dipole softness polarizability to be a non-transferable property. 

The contribution of dipole-softness to the polarizability can also be analyzed in 
an alternative way. The softness dipole vector of a molecule is easily decomposed into 
contributions of fragments in the Hirshfeld scheme. These fragment softness dipole 
vectors are related to the atomic softness dipole vectors [Eq. (135] as follows 

D x = drs x {r)er, 

= I dr SA (r)e(r-R A + R A ), 

= J2 bA + s ^ rA - ( 42 ) 

ABx 

Using the definition Eq. ffT6]) . one finds 

D = J dr Sx(r)er+ J dr s 2 (r)er, 

= D! + D 2 , (43) 

where Si and s 2 are computed by using Eq. §25§. Therefore the dipole softness 
contribution -^g-to the polarizability of a molecule [Eq. (IT7|l ] can be written as the 
sum of three terms 

-D 2 

D{ D 2 2 2 



Di.Do. (44) 
3S 3S 3S 1 2 { } 



This last equation allows to separate the dipole-softness contribution to the polar- 
izability into contributions of each fragment and a coupling term between the two 
fragments. 



3.4 Computational method 

The geometries of the twenty amino acids in their neutral form were calculated during 
a previous study using the MP2/6-311G(d,p) basis set in Gaussian03P The global 
polarizabilities were calculated using the same level of theory and basis set. The 
global softness was obtained by using the energies of the frontier molecular orbitals 
of the SCF density 

{tLUMO — tHOMo) ^ 

The dipole softness of the amino acids was computed using the Brabo package.^ 
The softnesses, polarizabilities and dipole softness of the backbone and side chain 
were evaluated using the Stock program,^ part of the Brabo package.^ The MP2 
density was used for the evaluation of the Hirshfeld-I weight function ua(t). The 
local softness sa of each atom in Eq. (liHl) was computed using the frontier-orbital 
approximation: . 



[/ dru A (r)p HO Mo(r) + J dru A (r)p LUMO (r) A 

s a ~ ] , (4oj 

* [£lumo — choaioI 

where the HOMO and LUMO densities are also extracted from the SCF density. The 
global softness of the common backbone fragment and the Coulomb hole q 1 ^ of the 
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amino acids family were calculated from Eq. (J3]) by applying a linear regression to 
the S/S2 curve. 

In the frontier-orbital approximation the softness dipole vector is computed using 



4 Results and Discussion 

The list of the twenty amino acids with their names, side chains and reactivity groups 
is given in Table 1. The amino acids are usually divided into eight different (reactivity) 



Global softness (S), local softness of the backbone (si) and side chain (32), global 
polarizability (a) and polarizability of the backbone (cti) and side chain (02) of these 
amino acids are presented in Table 2. Figure 1 depicts the global softness as func- 
tion of the local softness of the side chain. The linear correlation between these two 
quantities amounts to R = 0.9242, while the curve also demonstrates a clear sepa- 
ration of the amino acids into seven reactivity groups recovering to a large extent 
those defined in Table 1. It must be noted that the hydroxylic amino acids, as well 
as the proline amino acid, are grouped together with the aliphatic amino acids in 
the rest of the discussion of the results as the values of their softnesses are over- 
lapping. The values of the local softness S2 of the side chains follow the following 
order: alphatic<acidic<amide<basic<sulphur-containing<histidine<aromatic. At 



the BrabcP program as 




(47) 



groups according to the nature of the side chain.^ 
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first glance, this order agrees with the chemical intuition. Indeed, one expects the 
aromatic amino acids, the most polarizable members of the family, to be the "soft- 
est" . On the opposite, the aliphatic amino acids, the less polarizable molecules of the 
family, are expected to be the "hardest". This reasoning is based on the empirical 
rule that a polarizable atom is also a soft atom. One will see below that "this rule of 
thumb" must be used however with care. 

Fig. 1 can be compared to Fig. 4 in Ref. HTJ where the global softness of the amino 
acids, calculated using the same method and at identical geometries, is depicted as a 
function of the local softness of the side chain calculated using the method of Contreras 
et al. 32 Although the values of the local softness of the side chains differ from the 
values listed in Table 2 in Ref. ETJ the group separation is completely reproduced using 
the Hirshfeld-I method, demonstrating the robustness of the softness as a reactivity 
descriptor and its independence on the details of the partitioning method. 

The Coulomb hole and global (average) softness of the backbone fragment Si [Eq. 
dHJ)], derived by applying a linear regression to the curve S(s2) depicted in Fig. 1, 
are 0.7439 and 1.7113, respectively. Those values differ by only 0.5% from the values 
listed in Table 1 of Ref. being 0.7476 and 1.7196, respectively. The concept of the 
"Coulomb hole" is thus further confirmed, while its independence on the partitioning 
method is clearly demonstrated. 

On the other hand, the partitioning of amino acids into subgroups observed in 
Figure 1 and in Ref. (12TI) can be explained as follows. The global softness S in Table 
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2 shows relatively little variation among the twenty amino acids, being the smallest 
for aspartic acid (1.78 au) and largest for tryptophan (2.54 au). On the opposite, 
the partitioning of the global softness S between the fragments [Eq. f[25]) ] varies 
considerably. For instance, the local softness of the side chain s 2 varies between 0.10 
au (glycine) and 2.45 (tryptophan). A large similar variation is obviously observed 
for the softness of the backbone s\ as by definition s\ = S — s 2 . As shown in Figure 1, 
points in the (S,s) 2 map which are close identify residues belonging to the same sub- 
family. The results of Figure 1 can be summarized by saying that "similar molecules 
have similar global softness and similar contributions of their corresponding fragments 
to the global softness" . 

Figure 2 shows the strong correlation (R = 0.9975) between the global molecu- 
lar polarizability of an isolated amino acid and the polarizability of its side chain. 
Therefore the polarizability of an amino acid is largely determined by the polariz- 
ability of its side chain. On the other hand, the value of the polarizability of the 
backbone is similar for all isolated amino acids and has a value close to its aver- 
age value (28.55±0.81 au), except for proline (24.75 au) for which the side chain is 
bounded to the backbone on the contrary to the other amino acids. 

Another remarkable feature of the polarizability of an isolated amino acid is a 
strong correlation (R = 0.9659) between its polarizability and its number of electrons 
iV as can be seen in Figure 3. This correlation is a consequence of the relation between 
the polarizability and the molecular volume.^ Indeed, molecular polarizability and 
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molecular volume can be related to each other by using simple electrostatic models. 
For instance, for a spherical molecule of radius R represented by a dielectric sphere 
with a macroscopic dielectric constant e, the polarizability a sp here is given by^ 



R 3 (s 




47re V e 


+ 1. 


v ( 




16vr% V 


e + 



Sphere = X^T ( T7T ) ' ( 48 ) 

(49) 



where Eq is the vacuum permitivity and V the molecular volume. Assuming a homo- 
geneous molecular electronic density, 

N 

P = y, (50) 

one finds 

(^sphere _ 1 [ E \ tl^'\\ 

N = 16vr% p \e + T)' 1 ' 

In this model, the polarizability per electron would be constant for different molecules 
of different sizes (measured in the model by the parameter R) but having similar prop- 
erties (controled in the model by the values of the average density p and dielectric 
constant e). In the case of the amino acids, it must be noted that the approximated 
linear relation between the molecular size and the molecular polarizability is a con- 
sequence of the similarity of the examined molecules, which allows to describe them 
as a molecular family with a/N varying between 0.85 au (Asp) to 1.2 au (Trp). 

Softness and polarizability have always been intuitively related to each other as 
mentioned above. To test this relation, we have reproduced in Figure 4 the global 
polarizability per electron global a/N of the amino acids in function of their soft- 
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ness S. One observes some qualitative features, for instance, as mentioned above, 
the aromatic amino acids are the softest and the most polarizable. The correlation 
coefficient of the linear regression represented in Figure 4 is only 0.7297. Therefore, 
one cannot conclude that softness and polarizability are strictly proportional. An 
example is for instance the couple serine and isoleucine, which have nearly the same 
softness S ~ 1.82 au but different polarizabilities a/N m 0.85 [a ~ 48.21] au for Ser 
and a/N « 1.1 [a « 78.65] au for He. 

As was shown in the method section, the lack of proportionality between polariz- 
ability and softness is not surprizing since polarizability does not depend directly on 
the softness but also on the softness dipole vector through the term in Eq. (ITTj) . 
Table 3 lists the a DS contributions to the polarizabilities of the amino acids, their 
backbones (af ) and their side chains (a^ 5 ), calculated using Eq. |4"T1 

Because the partitioning of the global softness S between the fragments (back- 
bone plus side chain) varies considerably among the amino acids (Table 2), one ex- 
pects that the dipole-softness polarizability to vary significantly. The dipole-softness 
polarizability a DS ranges from -0.004 au for aspartic acid to -27.78 au for histi- 
dine. The value for histidine is also significantly larger than the rest of the values, 
the aromatic amino acids having values around -10 au. Also the distribution of 
the dipole-softness polarizability between the backbone (af 5 ) and the side chain 
{o-2 S ) differs from the distribution of the corresponding local softnesses. One ob- 
serves indeed that local softness values (Table 2) of the side chains (52) increase 



24 



in the order alphatic<acidic<amide<basic<sulphur-containirig<histidirie<aromatic, 
while the values for the backbone (s\) decrease in the same order. On the opposite, 
the variation of the values of the dipole-softness polarizability of the side chains a^ s 
or of the backbone a® s of the amino acids (Table 3) does not follow the classifica- 
tion of the residues in the biochemical families. In other words, amino acids with 
similar a^ s can be in different families as for instance Ala a^ s = —0.67 au and 
Met a^ s = —0.62 au. However one observes that absolute large values \a DS \ > 2.7 
for aromatic, amide, basic and sulfur-containing amino acids correspond also to large 
contributions of the side chains \a DS \ ~ lo^ 5 !- 

Figure 5 shows the very good correlation between the dipole-softness polarizability 
values of the amino acid and those of the side chain, with a correlation coefficient of 
R = 0.9890. This plot also allows some separation of the amino acids into reactivity 
groups, similar to Figure 1, although the order is different showing more overlap. The 
amino acids are divided into three major groups, the first one containing the acidic, 
aliphatic and amide amino acids, the second one containing the sulphur-containing, 
basic and aromatic amino acids and the third group contains histidine, which has a 
remarkably larger dipole-softness contribution than the rest. 

Table 4 contains the dipole-softness contributions partitioned using Eq. (j44]), 
where the total contribution to the molecular polarizability — is shown (identi- 
cal to the a DS in Table 3) together with the contribution of the backbone —^k, the 
side chain — & and the interfragmental contribution — 2L> }? 2 . In this partitioning, 
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the dipole-softness polarizabilities of the fragments are negative for the backbone and 
the side chain, whereas the interfragmental contribution is positive for most amino 
acids excep Gly and Ala. Remarkably, despite the difference in the two approaches 
[Eqs. (|4ip and (|44p ]. the values for the side chains listed in Tables 3 and 4 correlate 
considerably, with a correlation coefficient of R = 0.9905. Also the group separation 

D 2 

observed in Figure 6 by plotting the dipole-softness polarizabilities as function of — 
is similar to the one seen in Figure 5, with a slightly lower correlation coefficient of 
R = 0.9637. However, the contributions of the backbones, which do not become more 
negative than -3.2 au in Table 4, do not correlate with the dipole-softness polariz- 
ability values of the backbones reported in Table 3. The interfragmental contribution 
does not reflect the biochemical classification. 

One can conclude from Tables 3 and 4 that dipole-softness polarizability is a 
descriptor which permits to group the amino acids by similarity of their dipole-softness 
polarizabilities but this classification does not reflect any biochemical classification. 

5 Conclusions 

The molecular family of amino acids was studied by defining and computing the 
local softness and the local polarizability of their side chains. The concepts of the 
local softness of a fragment and of the Coulomb hole of a molecular family, that 
were introduced in Ref. [2TJ were found independent of the partitioning method of the 
electronic properties: both the Contreras et al. method (Ref. [21~|) and the Hirshfeld- 
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I method (present work) gave similar results. One concludes that the softness of 
the side chains allowes to separate the amino acids into groups reflecting their usual 
biochemical classification. Amino acids within of the same biochemical family are 
close to each other in the map (S, S2) (Fig. 1) which means that similar molecules 
have both similar global and local softnesses. 

The polarizability of the twenty amino acids and of their side chains were also 
computed. The global polarizability was found approximately proportional to the 
number of electrons N of the amino acid. Indeed, a/N varies between 0.85 au (Asp) 
to 1.2 au (Trp) within the whole set of amino acids. These results reflect the well- 
known proportionality between molecular polarizability and molecular size or volume 
[See Eq. flHD]. Consequently, the polarizability of the side chains does not reflect the 
biochemical classification of the amino acids on the contrary of their local softness. 

The statement that systems which are more polarizable than other are also softer 
was tested by comparing the softness and polarizability of the side chains of the amino 
acids. A very moderate correlation was found to exist between global softness and 
global polarizability per electron. It is shown that the global softness S of two amino 
acids may be identical whereas their polarizability a differ significantly. 

Finaly, a relation was derived between the local polarizability of a molecule and 
its local softness by applying the Berkowitch-Parr relation. The key quantity is the 
"softness dipole vector" of a fragment in Eq. (JT5]) . We found that the dipole-softness 
contribution to polarizability exhibits very different behavior than the local soft- 
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ness, explaining the lack of proportionality between softness and polarizability. The 
softness dipole vector contribution to the polarizability, as well as the polarizability 
itself, does not reflect the biochemical classification of the amino acids: for example, 
acidic and amide amino acids have as low values of dipole-softness contribution to 
the their polarizability as aliphatic amino acids, while histidine has a value of the 
dipole-softness contribution to the its polarizability almost three times as large as the 
value for aromatic amino acids. 

From the results presented in this paper one can conclude that the often assumed 
relation between softness and polarizability is not as straightforward as one might 
think, and that although polarizability and dipole softness undoubtedly reflect some- 
how the reactivity of a molecule, their values are not good descriptors of the molecular 
similarity, in contrast to softnesses. 
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7 Tables 



IName 


Abbrevation 


Side Chain 


Group 


Aspartic Acid 


A 

Asp 


-CH 2 COOH 


A * J * 

Acidic 


Glutamic Acid 


Glu 


-CH 2 CH 2 COOH 


A * J * 

Acidic 


A 1 

Alanine 


A 1 

Ala 


-CH 3 


A 1 * 1 x * 

Aliphatic 


Glycine 


r\ i 

Gly 


-H 


A 1 * "U J- ' 

Aliphatic 


T 1 

Isoleucme 


He 


CH(CH 3 )CH 2 CH3 


A 1 * 1 j_ * 

Aliphatic 


Leucine 


T 

Leu 


-ch 2 ch(ch 3 ) 2 


A 1 * U -i- * 

Aliphatic 


Proline 


T~) 

rro 


-C3H 6 (RING) 


All 1 ^ , j * , 

Aliphatic 


Serine 


ber 


-CH 2 OH 


TT J 1 • 

Hydroxyhc 


Threonine 


Inr 


-ch(oh)ch3 


Hydroxylic 


Valine 


Val 


-ch(ch 3 ) 2 


A 1 * "U J- * 

Aliphatic 


Asparagme 


A 

Asn 


-CH 2 CONH 2 


A ' 1 

Amide 


Glutamine 


Gin 


-CH 2 CH 2 CONH 2 


Amide 


Phenylalanine 


rtie 


-CH 2 ^PHENYL ) 


Aromatic 


Tryptophan 


Trp 


-ch 2 (c 2 h 2 n) (phenyl) 


Aromatic 


Tyrosine 


Tyr 


-ch 2 (phenyl)oh 


Aromatic 


Arginine 


Arg 


-ch 2 ch 2 ch 2 nhc(nh)nh 2 


Basic 


Lysine 


Lys 


-CH 2 CH 2 CH 2 CH 2 NH 2 


Basic 


Histidine 


His 


-ch 2 c 3 n 2 h 3 (ring) 


Histidine 


Cysteine 


Cys 


-CH 2 SH 


Sulphur-containing 


Methionine 


Met 


-CH 2 CH 2 SCH 3 


Sulphur-containing 



Table 1: The names, abbrevations, and formula's of the side chain and of the reactivity 
groups of the twenty amino acids examined in this study. 
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Name 




S 


Sl 


s 2 


a 


Q 


■i 


a 2 


Asp 


1 


.78 


1 


.12 


0.66 


59.76 


27 


.99 


21.14 


Glu 


1 


.88 


1 


.37 


0.51 


70.38 


28 


.48 


31.69 


Ala 


1 


.85 


1 


.56 


0.29 


45.52 


29 


.83 


9.55 


Gly 


1 


.84 


1 


.74 


0.10 


34.45 


30 


.46 


1.09 


lie 


1 


.82 


1 


.40 


0.42 


78.65 


27. 


.88 


38.29 


Leu 


1 


.85 


1 


.38 


0.46 


78.90 


28. 


.58 


38.82 


Pro 


1 


.88 


1 


.33 


0.56 


63.26 


24. 


.75 


27.41 


Ser 


1 


.82 


1 


.48 


0.34 


48.21 


29 


.16 


11.76 


Thr 


1 


.84 


1 


.34 


0.50 


59.31 


27 


.77 


22.41 


Val 


1 


.81 


1 


.42 


0.39 


67.57 


27 


.84 


29.20 


Asn 


1 


.84 





.94 


0.90 


64.08 


27 


.93 


25.21 


Gin 


1 


.87 


1 


.15 


0.72 


75.27 


27 


.99 


34.26 


Phe 


2 


.22 





.14 


2.08 


104.59 


28 


.00 


61.31 


Trp 


2 


.53 





.08 


2.45 


132.92 


27 


.76 


89.30 


Tyr 


2. 


.34 





.12 


2.23 


109.75 


28. 


.09 


65.17 


Arg 


2 


.12 





.71 


1.41 


100.14 


27. 


.74 


54.89 


Lys 


1 


.91 





.67 


1.24 


86.51 


28. 


.72 


42.46 


His 


2 


.19 





.04 


2.15 


84.83 


27. 


.35 


42.55 


Cys 


2. 


.00 





.35 


1.66 


62.03 


29 


.10 


23.78 


Met 


2 


.10 





.58 


1.52 


83.96 


29 


.15 


44.78 



Table 2: Global softness S (Eq. (14"5"|) ). local softness of the backbone and side chain si 
and $2 (Eq. fl25l)). calculated from the SCF density using the 6-311G(d,p) basis set, 
and global polarizability a and local polarizabilities of the backbone and side chain 
cti and ct2 (Eq. fl31|) ) of the twenty amino acids, calculated at the MP2/6-311G(d,p) 
level of theory. All quantities are in au. 
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Name 








Asp 


-0.04 


-0.15 


0.11 


Glu 


-1.76 


-2.05 


0.30 


Ala 


-0.98 


-0.67 


-0.31 


Gly 


-1.65 


-1.50 


-0.16 


He 


-2.69 


-2.67 


-0.02 


Leu 


-2.09 


-2.24 


0.15 


Pro 


-1.47 


-0.86 


-0.61 


Ser 


-2.12 


-1.88 


-0.25 


Thr 


-1.51 


-1.19 


-0.32 


Val 


-1.99 


-1.79 


-0.21 


Asn 


-1.46 


0.59 


-2.05 


Gin 


-1.14 


-1.38 


0.24 


Phe 


-11.18 


0.48 


-11.66 


Trp 


-9.82 


0.23 


-10.05 


Tyr 


-9.19 


0.35 


-9.54 


Arg 


-6.07 


1.22 


-7.28 


Lys 


-7.89 


1.25 


-9.14 


His 


-27.78 


0.17 


-27.95 


Cys 


-8.91 


0.04 


-8.96 


Met 


-5.94 


-0.62 


-5.31 



Table 3: Global dipole-softness polarizability a , local dipole-softness polarizabili- 
ties of the backbone af s and side chain a^ s of the twenty amino acids, calculated 
from the SCF density obtained with the 6-311G(d,p) basis set, using Eq. (14TI) . All 
quantities are in au. 
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Name 


D 2 


Df 


Dl 


2DiD 2 


?,S 


3.<? 






Asp 


-1.46 


-1.68 


-1.43 


3.07 


Glu 


-1.76 


-3.03 


-0.68 


1.95 


Ala 


-0.99 


-0.63 


-0.27 


-0.08 


Gly 


-1.66 


-1.38 


-0.04 


-0.23 


lie 


-2.69 


-3.23 


-0.59 


1.13 


Leu 


-2.09 


-2.97 


-0.58 


1.46 


Pro 


-1.47 


-1.06 


-0.82 


0.40 


Ser 


-2.13 


-1.91 


-0.29 


0.07 


Thr 


-1.51 


-1.44 


-0.57 


0.49 


Val 


-1.99 


-2.19 


-0.61 


0.81 


Asn 


-0.04 


-1.00 


-3.65 


3.19 


Gin 


-1.14 


-2.87 


-1.26 


2.98 


Phe 


-11.18 


-0.02 


-12.16 


1.00 


Trp 


-9.82 


-0.01 


-10.28 


0.47 


Tyr 


-9.19 


-0.02 


-9.91 


0.73 


Arg 


-6.07 


-2.36 


-10.86 


7.16 


Lys 


-7.89 


-1.49 


-11.88 


5.49 


His 


-27.78 


0.00 


-28.12 


0.34 


Cys 


-8.91 


-0.09 


-9.09 


0.27 


Met 


-5.94 


-0.92 


-5.61 


0.60 



Table 4: Global dipole-softness polarizability — dipole-softness polarizabilities of 

the backbone —jg, side chain — ^§ and the interfragmental contribution — 2r> ^ 2 
of the twenty amino acids, calculated from the SCF density obtained with the 6- 
311G(d,p) basis set using Eq. (144]) . All quantities are in au. 
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8 Captions 

Figure 1: The global softness S of the amino acids as function of the local softness s 2 
of the side-chain, calculated from the SCF density using the 6-311G(d,p) basis set. 
All values are depicted in au. 

Figure 2: The global polarizability a of the amino acids as function of the local 
polarizability a 2 of the side-chain [Eq. (J3"T]) ]. calculated at the MP2/6-311G(d,p) 
level. All values are depicted in au. 

Figure 3: The linear dependence of the polarizabilities of amino acids a, calculated 
at the MP2/6-311G(d,p) level, on the number of electrons N. 

Figure 4: The polarizability per electron a/N, calculated at the MP2/ 6-311G(d,p) 
level, as function of the global softness 5* of the amino acids, calculated from the SCF 
density obtained with the 6-311G(d,p) basis set. All values are depicted in au. 
Figure 5: The global dipole-softness polarizability — a DS as function of the the lo- 
cal dipole-softness polarizabilities of the side chain — a^ s ■, calculated from the SCF 
density obtained with the 6-311G(d,p) basis set. All values are depicted in au. 
Figure 6: The global dipole-softness polarizability as function of the the local 

D 2 

dipole-softness polarizabilities of the side chain -^J-, calculated from the SCF density 
obtained with the 6-311G(d,p) basis set. All values are depicted in au. 
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9 Figures 



3.0 -i 



2.8 - 




1.6 - 

1.4 - 
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0.0 0.5 1.0 1.5 2.0 2.5 
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■ Aliphatic O Acidic ▲ Amide □ Basic • Sulphur ♦ Histidine A Aromatic 



Figure 1: The global softness S of the amino acids as function of the local softness S2 
of the side-chain, calculated from the SCF density using the 6-311G(d,p) basis set. 
All values are depicted in au. 
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Figure 2: The global polarizability a of the amino acids as function of the local 
polarizability a 2 of the side-chain [Eq. (l3"Tj) ]. calculated at the MP2/6-311G(d,p) 
level. All values are depicted in au. 
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Figure 3: The linear dependence of the polarizabilities of amino acids a, calculated 
at the MP2/6-311G(d,p) level, on the number of electrons N. 
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0.95 - ♦ * 
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S (au) 

Figure 4: The polarizability per electron ot/N, calculated at the MP2/ 6-311G(d,p) 
level, as function of the global softness S of the amino acids, calculated from the SCF 
density obtained with the 6-311G(d,p) basis set. All values are depicted in au. 
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O Acidic ■ Aliphatic A Amide • Sulphur □ Basic A Aromatic ♦ Histidine 



Figure 5: The global dipole-softness polarizability — a DS as function of the the lo- 
cal dipole-softness polarizabilities of the side chain — a® 3 , calculated from the SCF 
density obtained with the 6-311G(d,p) basis set. All values are depicted in au. 



42 



30 

25 - 
20 - 



8 ^ 



10 



5 - 



□ 



10 



15 



20 



25 



30 



D 2 2 /3S (au) 

O Acidic ■ Aliphatic ▲ Amide ©Sulphur □ Basic A Aromatic ♦Histidine 



Figure 6: The global dipole-softness polarizability ^ as function of the the local 

D 2 

dipole-softness polarizabilities of the side chain ^J-, calculated from the SCF density 
obtained with the 6-311G(d,p) basis set. All values are depicted in au. 
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